Talbot County
LIFEBench: Evaluating Length Instruction Following in Large Language Models
Zhang, Wei, Zhou, Zhenhong, Wang, Kun, Fang, Junfeng, Zhang, Yuanhe, Wang, Rui, Zhang, Ge, Li, Xavier, Sun, Li, Lyu, Lingjuan, Liu, Yang, Su, Sen
While large language models (LLMs) can solve PhD-level reasoning problems over long context inputs, they still struggle with a seemingly simpler task: following explicit length instructions-e.g., write a 10,000-word novel. Additionally, models often generate far too short outputs, terminate prematurely, or even refuse the request. Existing benchmarks focus primarily on evaluating generations quality, but often overlook whether the generations meet length constraints. To this end, we introduce Length Instruction Following Evaluation Benchmark (LIFEBench) to comprehensively evaluate LLMs' ability to follow length instructions across diverse tasks and a wide range of specified lengths. LIFEBench consists of 10,800 instances across 4 task categories in both English and Chinese, covering length constraints ranging from 16 to 8192 words. We evaluate 26 widely-used LLMs and find that most models reasonably follow short-length instructions but deteriorate sharply beyond a certain threshold. Surprisingly, almost all models fail to reach the vendor-claimed maximum output lengths in practice, as further confirmed by our evaluations extending up to 32K words. Even long-context LLMs, despite their extended input-output windows, counterintuitively fail to improve length-instructions following. Notably, Reasoning LLMs outperform even specialized long-text generation models, achieving state-of-the-art length following. Overall, LIFEBench uncovers fundamental limitations in current LLMs' length instructions following ability, offering critical insights for future progress.
- Asia > China > Beijing > Beijing (0.04)
- North America > United States > Maryland > Talbot County (0.04)
- North America > United States > District of Columbia > Washington (0.04)
- Asia > Middle East > Jordan (0.04)
- Research Report > New Finding (1.00)
- Overview (0.93)
- Research Report > Experimental Study (0.67)
- Education (1.00)
- Health & Medicine (0.92)
- Information Technology (0.67)
- (2 more...)
Learning to See More: UAS-Guided Super-Resolution of Satellite Imagery for Precision Agriculture
Masrur, Arif, Olsen, Peder A., Adler, Paul R., Jackson, Carlan, Myers, Matthew W., Sedghi, Nathan, Weil, Ray R.
Unmanned Aircraft Systems (UAS) and satellites are key data sources for precision agriculture, yet each presents trade-offs. Satellite data offer broad spatial, temporal, and spectral coverage but lack the resolution needed for many precision farming applications, while UAS provide high spatial detail but are limited by coverage and cost, especially for hyperspectral data. This study presents a novel framework that fuses satellite and UAS imagery using super-resolution methods. By integrating data across spatial, spectral, and temporal domains, we leverage the strengths of both platforms cost-effectively. We use estimation of cover crop biomass and nitrogen (N) as a case study to evaluate our approach. By spectrally extending UAS RGB data to the vegetation red edge and near-infrared regions, we generate high-resolution Sentinel-2 imagery and improve biomass and N estimation accuracy by 18% and 31%, respectively. Our results show that UAS data need only be collected from a subset of fields and time points. Farmers can then 1) enhance the spectral detail of UAS RGB imagery; 2) increase the spatial resolution by using satellite data; and 3) extend these enhancements spatially and across the growing season at the frequency of the satellite flights. Our SRCNN-based spectral extension model shows considerable promise for model transferability over other cropping systems in the Upper and Lower Chesapeake Bay regions. Additionally, it remains effective even when cloud-free satellite data are unavailable, relying solely on the UAS RGB input. The spatial extension model produces better biomass and N predictions than models built on raw UAS RGB images. Once trained with targeted UAS RGB data, the spatial extension model allows farmers to stop repeated UAS flights. While we introduce super-resolution advances, the core contribution is a lightweight and scalable system for affordable on-farm use.
- North America > United States > Virginia (0.24)
- Atlantic Ocean > North Atlantic Ocean > Chesapeake Bay (0.24)
- North America > United States > Maryland > Prince George's County > College Park (0.14)
- (10 more...)
Dr. Frank Rosenblatt Dies at 43; Taught Neurobiology at Cornell - The New York Times
Frank Rosenblatt, associate pro fessor of neurobiology at Cor nell University, died here yes terday in a boating accident. It was his 43d birthday. He lived in Brooktondale, N. Y., an Ithaca suburb. An originator of perception theory, he had developed an experimental machine that could be trained to identify automatically objects or pat terns such as letters of the al phabet. The instrument was an electromechanical device con sisting of a sensory unit of photo cells that viewed the pat tern shown to the machine, as sociation units that contained the machine's memory and re sponse units that displayed vis ually its pattern‐recognition re sponse.
- North America > United States > New York > Westchester County > New Rochelle (0.08)
- North America > United States > Maryland > Talbot County > Easton (0.08)
Lost in Translation
Aaron Hertzman's Viewpoint "Computers Do Not Make Art, People Do," (May 2020, p. 45) makes excellent points as to why it is very unlikely that computers will ever replace artists. While I don't think he quite stated such, it appears to me that he may be of the opinion that replacement of (natural) intelligence (of human beings) with artificial intelligence is very unlikely. Most, if not all, of the endeavors we are addressing are based on digital technology, and possibly cannot replace analog entities. It is unfortunate, however, that with the hype these days, people are either unaware of reality, or simply ignoring reality, with undesirable consequences. I like to cite a voicemail transcription I received recently.
- North America > United States > Illinois > Cook County > Chicago (0.06)
- North America > United States > Maryland > Talbot County > Easton (0.05)
- North America > United States > California > Santa Clara County > Palo Alto (0.05)